o 



0 



Raw Sequence Listing Error Summary 



O 

o 



3> 



ERROR DETECTED 



SUGGESTED C ORRECTIO N SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA* HEADERS, WHICH WERE INSERTED BY PTO SO^^ARg <^ 



m ^ 



_Wrapped Nucleics The numberAcxt at the end or each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 

Jnvalid Line Length The rules require that a line not exceed 72 characters in length, This includes white spaces. 



o 

CO 

o 



^Misaligned Amino 
Numbering 

Non- ASCII 




6 \S Patentln 2.0 
"bug" 



Skipped Sequences 



(OLD RULES) 



8 Skipped Sequences 

r„- (NEW RULES) 



Use of n's or Xaa's 
"(NEW RULES) 



j0 invalid <2i3> 
Response 



11 



Useof<220> 



12 



_PatentIn 2.0 
"bug" 



The numbering under each 5^ amino acid is misaligned Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was noi saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved In ASCII text 

Sequence^) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section-that some may be missing. 



A "bug" in Patent! 
sequencers) 



version 2.0 has caused the <220>-<223> section to be missing from amino acid 
Normally, Patentln would automatically generate this section from the 



previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.' 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where M X"Js shown) 
This sequence is intentionally skipped 

Please also adjust the M (ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



Sequencers) _ 



_ missing. If Intentional, please insert the following lines for each skipped sequence. 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of la or Xaa, and which residue n or Xaa represents, 

a' 

Per i.823 of Sequence i&icS, the only valid <2I3> responses arc: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence ' t 



Sequencers) m 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2I3> -Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See -Federal Register," 06701/1998, VoL 63, No. 104, pp. 29631-32) (Sec 1.823 of Sequence Rules) 

i 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/2 1/2001 




BIOTECHNOLOGY 
SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing tine following computer readable 
form: 




Application Serial Number: 09 / Cyffl/Z^lA 

Source: tyQp 

Date Processed by STIC: fy/iuXoQX 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@nspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www,uspto>gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<htifr>://wvmonspto.gov , EFS Submission 
User Manna! - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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4- 



•a 
j 


<110> 


APPLICANT 


: VAISVILA, ROMUALDAS 












Do@s 


4 






MORGAN, RICHARD D 


















0@rr 




5 






KUCERA, REBECCA B 






















0 






CLAUS, 


TOBY E 
























7 






RALEIGH, ELISABETH A. 




















9 


<120> 


TITLE 


OF INVENTION: METHOD FOR CLONING AND PRODUCING THE Mi 


xu 






ENDONUCLEASE 
























12 


<130> 


FILE REFERENCE: NEB- 181 


















1 A 

14 


<140> 


CURRENT APPLICATION NUMBER: 09/689, 343A 










X D 


<141> 


CURRENT FILING DATE: 


2000- 


■10-12 














X / 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


9 


















xy 


<170> 


SOFTWARE : 


Patentln Ver. 2 


n 


















<210> 


SEQ ID NO 


: 1 
























22 


<211> 


LENGTH: 903 
























23 


<212> 


TYPE: 


DNA 




























<213> 


ORGANISM: 


Micrococcus 


sp. 


















c 
ZD 


<220> 


FEATURE : 


























27 


<221> 


NAME/KEY : 


CDS 
























2o 


<222> 


LOCATION: 


(1) 


. (900) 




















30 


<400> 


SEQUENCE : 


1 
























31 


atg 


cct 


ate 


teg 


ace 


gtc 


tgg 


acg 


ccg 


gac 


gga 


gac 


gac 


etc 


ate 


gtg 




Met 


Pro 


He 


Ser 


Thr 


Val 


Trp 


Thr 


Pro 


Asp 


Gly 


Asp 


Asp 


Leu 


He 


Val 


3 J 


1 










5 










10 










15 




35 


gag 


gcg 


gac 


aac 


etc 


gat 


ttc 


att 


caa 


acg 


etc 


ccc 


gac 


gcg 


age 


ttc 


36 


Glu 


Ala 


Asp 


Asn 


Leu 


Asp 


Phe 


He 


Gin 


Thr 


Leu 


Pro 


Asp 


Ala 


Ser 


Phe 


37 










20 










25 










30 






39 


cga 


atg 


ate 


tac 


ate 


gat 


ccg 


ccg 


ttc 


aac 


aca 


ggg 


cga 


acg 


cag 


egg 


40 


Arg 


Met 


He 


Tyr 


He 


Asp 


Pro 


Pro 


Phe 


Asn 


Thr 


Gly 


Arg 


Thr 


Gin 


Arg 


41 








35 










40 










45 








43 


ctt 


cag 


teg 


etc 


aag 


acg 


acc 


cgc 


teg 


gtc 


aca 


ggg 


teg 


cga 


gtc 


ggc 


44 


Leu 


Gin 


Ser 


Leu 


Lys 


Thr 


Thr 


Arg 


Ser 


Val 


Thr 


Gly 


Ser 


Arg 


Val 


Gly 


45 




50 










55 










60 










47 


ttc 


aaa 


ggc 


cag 


acg 


tac 


gac 


acg 


gtc 


aag 


age 


act 


ctg 


cac 


teg 


tat 


48 


Phe 


Lys 


Gly 


Gin 


Thr 


Tyr 


Asp 


Thr 


Val 


Lys 


Ser 


Thr 


Leu 


His 


Ser 


Tyr 


49 


65 












70 










75 










80 


51 


gac 


gac 


get 


ttc 


ace 


gac 


tat 


tgg 


teg 


ttc 


etc 


gaa 


ccg 


cgt 


etc 


ctg 


52 


Asp 


Asp 


Ala 


Phe 


Thr 


Asp 


Tyr 


Trp 


Ser 


Phe 


Leu 


Glu 


Pro 


Arg 


Leu 


Leu 


53 












85 










90 










95 




55 


gag 


get 


tgg 


egg 


ttg 


etc 


acc 


cct 


gac 


ggc 


gcg 


etc 


tat 


ctt 


cat 


ctg 


56 


Glu 


Ala 


Trp 


Arg 


Leu 


Leu 


Thr 


Pro 


Asp 


Gly 


Ala 


Leu 


Tyr 


Leu 


His 


Leu 


57 










100 










105 










110 






59 


gat 


tac 


cgc 


gag 


gtt 


cac 


tac 


gec 


aag 


gtc 


gtc 


etc 


gac 


gcg 


atg 


ttc 


60 


Asp 


Tyr 


Arg 


Glu 


Val 


His 


Tyr 


Ala 


Lys 


Val 


Val 


Leu 


Asp 


Ala 


Met 


Phe 



Not Comply 
Diskette Weeded 

sel RESTRICTION 



48 



96 



144 



192 



240 



288 



336 



384 
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61 






115 










120 










125 










63 


aa?\ 
y y a 


cgc 


gaa 


age 


ttc 


ctg 


aac 


y a y 


eta 

i_y 


ate 


rcrcr 
u yy 


aca 
y uy 


tac 


era r* 
y ci u 


tan 


y y u 


432 


64 


Glv 


Arg 


Glu 


Ser 


Phe 


Leu 


Asn 


Glu 


Leu 


lie 


Tro 


Ala 


Tvr 

j. y x 


Asp 


Tvr 


Glv 




6S 




130 










135 










140 












67 


y ^y 


cgc 


teg 


aag 


age 


aag 


t*crcr 
uy y 


ccc 


ace 


aag 


eac 


gac 


aac 


ate 


etc 


CTT" CI 

y i-y 


480 


68 


Ala 


Arg 


Ser 


Lys 


Ser 


Lys 


TrD 


Pro 


Thr 


Lys 


His 


Asp 


Asn 


lie 


Leu 


Val 




6Q 


IAS 










150 










155 














71 


L.CL U 


guy 


ciciy 


rr a r*« 
y clo 


^^*-y 


aac 


aac 


tan 


n't" o 


+- rrrr 

uy y 


aac 


y y u 


narr 
v»Giy 


rr ^ +■ 
y CL L. 


y L-u 


rra "t" 
y a l. 


528 


72 


Tvr 
i i i 


Val 


Lys 


Asp 


Pro 


Asn 


Asn 


Tvr 


Val 


TrD 


Asn 


Glv 


Gin 


Asp 


Val 


Asp 




7 ^ 










165 










170 










175 






75 




rra rr 




tan 


a 1" Cf 
a t-y 




ccc 


y y y 


etc 


gtt 


aea 


ccc 


rra rr 

y ay 


day 


y L-Ci 


y uy 


576 


76 


Arcr 


Glu 


Pro 


Tvr 
j 


Met 


Ala 


Pro 


Glv 


Leu 


Val 


Thr 


Pro 


Glu 


Lys 


Val 


Ala 




77 








180 










185 










190 








79 


ctt 


aac* 


aag 


eta 


ccc 


ace 


gac 


gtc 




L yy 


cac 


aea 


ate 


att 


ccg 


cct 


624 


80 


Leu 


Glv 


Lys 


Leu 


Pro 


Thr 


Asp 


Val 


TrD 


TrD 


His 


Thr 


lie 


Val 


Pro 


Pro 




81 






195 










200 










205 










83 


y uy 


age 


aaa 


y ay 


cgc 


acc 


aaa 
y y y 


tac 


y °y 


aea 


cag 


aag 


ccg 


gtc 


acre 


ate 


672 


84 


Ala 


Ser 


Lys 


Glu 


Arg 


Thr 


Glv 


Tvr 


Ala 


Thr 


Gin 


LVS 


Pro 


Val 


Glv 


He 




85 




210 










215 










220 












87 


ate 


cgt 


cgc 


atg 


att 


cag 


gcg 


age 


age 


aat 


gaa 


ggc 


gac 


tgg 


gtt 


ctg 


720 


88 


He 


Arg 


Arg 


Met 


He 


Gin 


Ala 


Ser 


Ser 


Asn 


Glu 


Gly 


Asp 


Trp 


Val 


Leu 




89 


225 










230 










235 










240 




91 


gat 


ttc 


ttc 


get 


ggt 


agt 


ggg 


acg 


acc 


ggc 


gee 


gcg 


gee 


cgc 


cag 


etc 


768 


92 


Asp 


Phe 


Phe 


Ala 


Gly 


Ser 


Gly 


Thr 


Thr 


Gly 


Ala 


Ala 


Ala 


Arg 


Gin 


Leu 




93 










245 










250 










255 






95 


gga 


cgc 


cgt 


ttt 


gtg 


etc 


gta 


gac 


gtc 


aac 


cca 


gaa 


gca 


ate 


gcg 


gta 


816 


96 


Gly 


Arg 


Arg 


Phe 


Val 


Leu 


Val 


Asp 


Val 


Asn 


Pro 


Glu 


Ala 


He 


Ala 


Val 




97 








260 










265 










270 








99 


atg 


gca 


aaa 


egg 


ttg 


gat 


gac 


ggg 


gca 


ttg 


gac 


acc 


age 


gtg 


acg 


ate 


864 



100 Met Ala Lys Arg Leu Asp Asp Gly Ala Leu Asp Thr Ser Val Thr He 

101 275 280 285 

103 gtg cag act ccc cag agt gac cca cga acc gac gga tga 903 

104 Val Gin Thr Pro Gin Ser Asp Pro Arg Thr Asp Gly 

105 290 295 300 

108 <210> SEQ ID NO: 2- 

109 <211> LENGTH: 300 

110 <212> TYPE: PRT 

111 <213> ORGANISM: Micrococcus sp. 

113 <400> SEQUENCE: 2 

114 Met Pro He Ser Thr Val Trp Thr Pro Asp Gly Asp Asp Leu He Val 

115 15 10 15 

117 Glu Ala Asp Asn Leu Asp Phe He Gin Thr Leu Pro Asp Ala Ser Phe 

118 20 25 30 

120 Arg Met He Tyr He Asp Pro Pro Phe Asn Thr Gly Arg Thr Gin Arg 

121 35 40 45 

123 Leu Gin Ser Leu Lys Thr Thr Arg Ser Val Thr Gly Ser Arg Val Gly 

124 50 55 60 

126 Phe Lys Gly Gin Thr Tyr Asp Thr Val Lys Ser Thr Leu His Ser Tyr 

127 65 70 75 80 
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1 OQ 

X 4* J 


7\ en 


A en 


Ala 


XT 1IC 




A on 
nop 


lyx 


Trn 
x x r 




rile 


T on 




IT X Vj 


A T*f¥ 

r\x y 


Leu 


Leu 












85 
























1 10 

J. J £, 


r> 1 n 
ulu 


nXa 


HP rn 

irp 


Arg 


Leu 


Ijcu 


HP Vi t* 


Pro 


A en 




Ala 


Leu 


iyr 


Leu 


IlXo 


Leu 


1 J J 








100 










105 










ilv 






1 

1 J J 


Asp 


iyr 


A T*fT 

r\X y 


Glu 


Val 


Hi q 
ills 


iyr 


Ala 


T,VG 

Ujr o 


Va 1 
val 


Va 1 
val 


Leu 




Ala 
Aid 


Wet 


rue 


J. J \J 






115 










120 










1 Z J 








1 


Vjxy 


Arg 




Ser 


Phe 


Leu 


Asn 


(2 1 ii 


JjcU 


T"| 0 
lie 


Trp 


Ala 
riX d 


Tyr 


Asp 


Tyr 


viiy 


i j ? 




J. JV 










135 










1 AD 










1 A 1 


Hid. 


Arg 


otii 


Lys 


Ser 


Lys 


Trp 


riU 


HP Vi t* 
1 I1X 


Lys 


HIS 


Asp 


Asn 


Tin 

xxe 


Leu 


Val 


1 AO 


IAS 
it j 










150 










1 










lOU 


1 Tt *± 


iyr 


V CLX 


it j o 


Asp 


Pro 


Asn 


Asn 


lyx 


Va 1 

V QX 


irp 


Asn 


uiy 


VjXll 


Asp 


Va 1 
Vdl 


Asp 


IAS 

1** *J 










165 










X / U 










1 7R 

l / J 




1 A7 








Tyr Met 


Ala 


Pro 


\j j. y 


1-1 CIA 


Va 1 

V CIX 


HPVi 

j. ii j. 


Jtr X *J 




Xijr o 


Va 1 
vdl 


Ala 
Ala. 


148 








180 










185 

X O 










1 QO 

X 7 V 






150 




rz"l v 




Leu 


Pro 


Thr 


Asp 


Val 

V CI _l_ 


HP T"Tl 

ixp 


HP y*t"i 

irp 




X 11X 


Tip 
lie 


Va 1 
v dx 


rl vj 




151 






195 










200 










205 
\j -j 








-L J O 


Ala 

n±u 


Opr 
OC1 


i-i y o 


Glu 


Arg 


Thr Gly 




Ala 


Thr* 
i in 


ulll 


Xiy o 


rl U 


Va 1 

V dX 


m v 

vsiy 


Tip 

lie 


1 SA 




210 










215 










ooa 










J. J D 


He 


Arg 


Arg 


Met 


He 


Gin 


Ala 




Cor 

OCX 


A cn 
noli 




oxy 


A cn 
nop 


irp 


Va 1 
vdl 




1 J / 


225 










230 










0 ^ ^ 










Z ft U 


1 

-L -3 17 


Asp 


Phe 


Phe 


Ala 


Gly 


Ser Gly 


T 1 V» t* 
1 XIX 


Tnr 


(2~l \r 

oxy 


Ala 
Aid 


Ala 
Aid 


Ala 
Aid 


Arg 


Pin 
uin 


Leu 


i fin 










245 










0 ^fi 
Z J Vj 










occ 




lOZ 


Gly 


Arg 


Arg 


Phe 


Val 


Leu 


Val 


Asp 


Val 


Asn 


Pro 


Glu 


Ala 


He 


Ala 


Val 


ID J 








260 










265 










270 






1 fiS 

1UJ 


Met 


Ala 


Lys 


Arg 


Leu 


Asp 


Asp 


Gly 


Ala 


Leu 


Asp 


Thr 


Ser 


Val 


Thr 


He 


1DD 






275 










280 










285 










Val 


Gin 


Thr 


Pro 


Gin 


Ser Asp 


Pro 


Arg 


Thr 


Asp 


Gly 










-L D 3 




290 










295 










300 










1 10 


<210> SEQ ID NO: 


: 3 
























173 


<211> LENGTH: 1236 
























1 7A 


<212> TYPE: 


DNA 


























175 


<213> ORGANISM: 


Unknown 






















177 


<220> FEATURE: 


























178 


<223> OTHER 


INFORMATION: 


; Description of 


Unknown 


Organism: ENVIR< 


180 


<220> FEATURE: 


























181 


<221> NAME/KEY: 


CDS 
























182 


<222> LOCATION: 


(1). 


. . (1233) 




















1 ftA 
J. Oft 


<220> FEATURE: 


























185 


<223> OTHER 


INFORMATION; 


: At 


position 594, N 


- G, 


A, 


C or T 




187 


<400> SEQUENCE: 


3 
























188 


atg 


cct 


aca 


ctg 


gat 


tgg 


ccc 


ggt 


aaa 


cag 


tta 


age 


ttc 


cca 


cca 


get 


189 


Met 


Pro 


Thr 


Leu 


Asp 


Trp 


Pro 


Gly 


Lys 


Gin 


Leu 


Ser 


Phe 


Pro 


Pro 


Ala 


190 


1 








5 










10 










15 




192 


acc 


tec 


ttg 


cat 


ctg 


gag 


agt 


gtg 


gtc 


act 


gag 


gga 


gcg 


gag 


tea 


ccg 


193 


Thr 


Ser 


Leu 


His 


Leu 


Glu 


Ser 


Val 


Val 


Thr 


Glu 


Gly Ala 


Glu 


Ser 


Pro 


194 








20 










25 










30 






196 


cct 


aat 


cgt 


ctg 


att 


tgg 


gcg 


gac 


aac 


ctg 


ccg 


eta 


atg 


gta 


gat 


ttg 


197 


Pro 


Asn 


Arg 


Leu 


He 


Trp 


Ala 


Asp 


Asn 


Leu 


Pro 


Leu 


Met 


Val 


Asp 


Leu 


198 






35 










40 










45 









48 



96 



144 
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y 


gaa 


U-CL L. 


gaa 


yy y 


a f\ 3 

ci a. a. 


ate 


era 1~ 


p+- pr 

tty 


ate* 

d 


tap 

Lav/ 


prpp 


aat 
y d u 


p p p 


ppt 


1 Q9 
x y & 


201 


Leu 


Ala 


Glu 


Tvr 

x y x 


Glu 


Glv 


Lys 


lie 


Asp 


Leu 


lie 


Tvr 


Ala 


Asp 


Pro 


Pro 




909 
z u z 




so 




















fiO 












90 A 






ppr 
d t y 


y a l. 


ty U 


a p "f~ 


td L. 


gtg 


gcg 


ty a 


d 1 1 


yy >- 


paf 

td t 


rr rr rr 

yyy 


rr a pr 

gag 


cxat 
y d t 


94.0 


205 


Phe 


Phe 


Thr 


ASp 


Arg 


Thr 




Ala 


Ala 


rtxy 


lie 


Glv 


His 


Glv 


Glu 


A^n 




90fi 

\J O 


U -J 










70 










/ j 










O v 




90ft 

Z \J O 


t ppt 

teg 


cgt 


cgt 


p p a 
ttd 


p j* a 


a P P 


tgg 


parr 


ntt 
ttt 


y ta. 


y dd 


gga 


tat 

td t 


p rr 

d ty 


PTJ* P 

y dt 


gag 


9ft fl 
ZOO 


^ u 7 


Cpr 

OCX 


fix y 




Drn 

IT J-\J 


OX 11 


Th t* 

X 11X 




G1 n 


ucu 


Ala 

f\Xd 


fil n 

V3X U 


G1 v 
vjx y 


T VT* 


TVlT 
± 11X 


Ren 


























QO 










QS 








tgg 


ddy 


y d u 


tta 


y a L 


y dd 


+■ p p 


r>t rr 

ctg 


y cit 


t tt 


1 1 u 


tat 
L-d L. 


p p a 
t td 


p/Tp 

ty t 


P+* PT 

t ty 


y td 


O JO 


91 ^ 

Z X J 


Trn 

iX F 


XI jf o 




XJCSU 


A QD 




iyx 


iJCU 


A QD 


IT lie? 




iyx 


prp. 


A T*Pf 

ax y 


ucu 


Va 1 

V d X 




914 








100 

-L U U 










1 OS 

X \J J 










110 








216 


of pr 

o i-y 


d i— y 


tat 


cga 


ctg 


tty 


y ^cl 


cca 


pap 


y yd 


aprr 
d^y 


p -f- p 


tap 


■H -H pr 

ttg 


pop 


p-f- pr 
\* ty 


J O *x 


217 


Leu 


Met 


Tvr 


Arg 


Leu 


Leu 


Ala 


Pro 


His 


Glv 


Thr 


Leu 


Tvr 
± x x 


Leu 


His 


Leu 




218 






115 










120 










125 










990 

Zi £i \J 


pr a P 


tyy 


pap 
Q. 


y 


aat 




tan 


nt a 

y L.CL 


ty t 


tit a 
y i~d 


p +* pr 

v ^y 




pr a +- 
y d L. 


era pt 

y a y 


o -f- p 
d ut 


t tt 


A ^9 

*X O 4L» 


221 


Asp 


Tro 


His 


Ala 


Asn 


Ala 


Tvr 


Val 


Arg 


Val 


Leu 


Leu 


Asp 


Glu 


lie 


Phe 




999 
z z z 




1 ^0 




















140 

X *t u 












224 


prrr pr 

y y y 


cga 


cag 


^y y 


ttt 


r*t r* 


aac 


cm pt 
yay 




pr +■ p 

y L.v^ 


t pr pr 

tyy 


air 1 

d LL> 


tat 

L>d 1— 


pa p 


yy t 


p p p 

t 


AftO 

*± O \J 


225 


Glv 


Arg 


Gin 


Arg 


Phe 


Leu 


Asn 


Glu 


lie 


Val 


TrD 


lie 


Tvr 


His 


Glv 


Pro 




226 


145 










150 










155 










160 




99R 
z z o 


L-td 


pr pp 
y ^ \— 


a + p 
CI 




ty t 


y 




3 ^ rr 
u d y 


pp;p 

ty t 


Ann 
ddd 


pn +■ 

td L. 


y d t 


a p p 
dot 


atn 
d tt 


tt n 

ttg 


y 1 1 


S9ft 


229 


Ser 


Ala 


lie 


Arg 


Arg 


Ala 


Phe 


Lys 


Arg 


Lys 


His 


Asp 


Thr 


lie 


Leu 


Va 1 

V dX 




230 










165 










1 70 

X / \J 










1 7S 

X f J 






9^9 


L.Q L. 


g tg 


ddd 


prpft - 


pr a 
y CtCl 




L.CI L. 


^pa 


t ut 


du L. 


<r p rr 

gcg 


y d t 


gcg 


y 1 1 


cgt 


caa 


S7fi 
J / o 


233 


Tvr 


Val 


Lys 


Glv 


Glu 


Asn 


Tvr 


Thr 


Phe 


Asn 


Ala 


Asp 


Ala 


Val 


Arg 


Gin 




/ 9 A 

\ / Z O *± 








1 RO 
x O v 




r 






1 RS 

lOJ 










1 QO 








W; ^ Z J O 


cct 


■J- a ^ 




ccg 


age 


acn 


cat 


aag 


acc 


ttr> 
ttt 


get 


tec 


tec 


ccg 


aag 


gec 




W--> 9 "5 7 


Pro 


x J x 


His 


Pro 


Ser 


Xaa 


His 


Lys 


Thr 


Phe 


Ala 


C o >» 

OCX 


OCX 


* X \J 




Al a 










1 QS 

X -7 J 










900 

£i \i \J 










90S 

U J 










940 


yyt 




pr /"T+" 

ggt 


ddy 


gtg 


p ppr 
L^y 


pro -f- 

y ci l. 


pi +- pr 

t ty 


part 

tcty 


p pT p 

ty t 


yy c 


p a a 

ddd 


pr+- pr 

gtg 


POP 

ttt 


gaa 


gac 


fi79 


241 


Glv 


Phe 


Glv 


Lys 


Val 


Pro 


ASp 


Leu 


Gin 


Arg 


Glv 


Lys 


Val 


Pro 


Glu 


Asp 




242 




910 










91 S 

^ X J 










990 
^ ^ u 












244 




■egg 


j j_ 

L-Ct L. 


-U 4- 4- 

L. 1_ L. 


^— %— y 


y '-^ 


rr+- rr 


rrpp 


ppj-f- 


ota 


pn p 


d 


y dd 


p pr f~T 

tgg 


a rtp 

dy t 


pr pr p 

ggc 


790 


245 


j. x p 


Tro 


Tvr 

j. _y x 


Phe 


Pro 


Val 


Val 


Ala 


Arg 


Leu 


His 


Arg 


Glu 


Arg 


Ser 


Glv 




246 


225 










230 










*j •*> ~j 










9d0 




249 


tat 


ccg 


act 


caa 


aag 


cct 


caa 


y v*-* 


tty 


p+- pr 

t Ly 


era cr 
y dy 


p pj-pr 
tyy 


atr 


r*t n 


p -r pr 

o uy 


pr p p 

y ^- 1 




250 


Tvr 


Pro 


Thr 


Gin 


Lys 


Pro 


Gin 


Ala 


Leu 


Leu 


Glu 


Arcr 

n.x y 


lie 


Leu 


Leu 


Ala 




251 










245 










250 










255 






253 


tec 


teg 


aac 


gca 


ggc 


gat 


ctg 


gtg 


gca 


gac 


ttc 


ttc 


tgc 


ggc 


tea 


ggg 


816 


254 


Ser 


Ser 


Asn 


Ala 


Gly 


Asp 


Leu 


Val 


Ala 


Asp 


Phe 


Phe 


Cys 


Gly 


Ser 


Gly 




255 








260 










265 










270 








257 


aca 


acc 


get 


gtg 


gtg 


gca 


gec 


cgt 


ctg 


gga 


egg 


cgc 


ttc 


ctg 


gtc 


aac 


864 


258 


Thr 


Thr 


Ala 


Val 


Val 


Ala 


Ala 


Arg 


Leu 


Gly 


Arg 


Arg 


Phe 


Leu 


Val 


Asn 




259 






275 










280 










285 










261 


gat 


gca 


age 


tgg 


cgc 


gec 


gtt 


cat 


gtg 


aca 


cgc 


aca 


cgc 


ttg 


eta 


cgc 


912 


262 


Asp 


Ala 


Ser 


Trp 


Arg 


Ala 


Val 


His 


Val 


Thr 


Arg 


Thr 


Arg 


Leu 


Leu 


Arg 




263 




290 










295 










300 












265 


gag 


gga 


gta 


agt 


ttc 


act 


ttt 


gaa 


cgc 


cag 


gaa 


act 


ttt 


act 


eta 


cct 


960 



file://C:\Crf3\Outhold\VsrI689343A.htm 



4/16/02 



Page 5 of 9 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/689, 343A 



DATE: 04/16/2002 
TIME: 11:52:53 



Input Set : A:\NEB-181.txt 

Output Set: N:\CRF3\04162002\I689343A.raw 



266 


Glu 
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Thr 


Phe 
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315 
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\j y 
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CL L. O 
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gec 
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y ay 
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att 


270 
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Gin 


Pro 


Leu 


Pro 


Pro 


Asp 


TrD 
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He 


He 
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Gin 
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271 
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uy y 
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caa 
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Arcr 
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Leu 
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Phe 


Tro 


Glu 


Val 


Asp 


Asp 


Gin 


£t f J 








340 










345 










350 






Oil 

£,11 


<-gy 


y CL. L. 


yy c 


a a a 


CL L-V^ 






7\ err* 
ay \^ 


v^y l. 


K*G. U 


r*Pk a 

\*CLCL 


ggc 


T + a 


v_»y o 
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97ft 


Trp 


Asp 


Gly 


Lys 


Tip 

X XC 




A TCI 




fix y 


D-j e 
mo 


\3 JL Ll 


m v 

OX Jf 






Opr 
OCX 


*\xy 


279 






355 










O C (\ 

360 










•a c c 

365 








281 


ctt 


cag 


gag 


cag 


gcg 


ccg 


etc 


tct 


eta 


cca 


ttg 


ace 


ggg 


aat 


gga 


ctg 


282 


Leu 


Gin 


Glu 


Gin 


Ala 


Pro 


Leu 


Ser 


Leu 


Pro 


Leu 


Thr 


Glv 


Asn 




Leu 


283 




370 










375 










380 










285 


ttg 


tgt 


gta 


egg 


gta 


gtg 


age 


cgt 


gaa 


ggg 


gaa 


tac 


tat 


gag 


ttc 


aca 


286 


Leu 


Cys 


Val 


Arg 


Val 


Val 
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Arg 


Glu 
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Glu 


Tyr 


Tvr 
x_y x 


Glu 


Phe 


Thr 


& O f 


385 
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395 
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Ser 


Pro 
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Pro 


Val 
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Phe 












291 










405 










410 














294 


<210> SEQ ID NO: 


: 4 
























4* ZJ -) 


<211> LENGTH: 411 












/ 




C 








296 


<212> TYPE: 


PRT 






\ AM Au^ 












<213> ORGANISM: 


/Unknown 


/ ~ 




















299 


<220> FEATURE: 


























J u u 


<223> OTHER 


INFORMATION: 


: At 


position 198, Xaa = 


any 


amino acid 


302 


<400> SEQUENCE: 


4 
























303 


Met 


Pro 


Thr 


Leu 


Asp 


Trp 


Pro 


Gly 


Lys 


Gin 


Leu 


Ser 


Phe 


Pro 


Pro 


Ala 


304 


1 








5 










10 










15 




306 


Thr 


Ser 


Leu 


His 


Leu 


Glu 


Ser 


Val 


Val 


Thr- 


Glu 


Gly 


Ala 


Glu 


Ser 


Pro 


^07 








20 










25 










30 






*j \j ? 


Pro 


Asn 


Arg 


Leu 


lie 


Trp 


Ala 


Asp 


Asn 


Leu 


Pro 


Leu 




Val 


Asp 


Leu 


310 






35 










40 










45 








312 


Leu 


Ala 


Glu 


Tyr 


Glu 


Gly 


Lys 


Tl a 

xxc 


Asp 


Leu 


He 


Tyr 


Ala 


ASP 


Pro 


Pro 


313 




50 










55 










60 










315 


Phe 


Phe 


Thr 


Asp 


Arg 


Thr 


Tyr 


Ala 


Ala 


Arg 


He 


Gly 


His 


Gly 


Glu 


Asp 


316 


65 










70 










75 










80 


^1 ft 


Ser 


Arg 


Arg 


Pro 


Gin 


Thr 


Trp 


Gin 


Leu 


Ala 


Glu 


Gly 


1 Y r 


Thr 


Asp 


Glu 


319 










85 










90 










95 




321 


Trp 


Lys 


Asp 


Leu 


Asp 


Glu 


Tyr 


Leu 


Asp 


Phe 


Leu 


Tyr 


Pro 


Arg 


Leu 


Val 


322 








100 










105 










110 
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Leu 


Met 


Tyr 


Arg 


Leu 


Leu 


Ala 


Pro 


His 


Gly 


Thr 


Leu 


Tyr 


Leu 


His 


Leu 


325 
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120 










125 
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Trp 


His 


Ala 
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Ala 


Tyr 


Val 


Arg 


Val 


Leu 


Leu 


Asp 


Glu 


He 


Phe 


328 




130 










135 










140 
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Gly 


Arg 


Gin 


Arg 


Phe 


Leu 


Asn 


Glu 


He 


Val 


Trp 


He 


Tyr 


His 


Gly 


Pro 


331 


145 










150 










155 










160 


333 


Ser 


Ala 


He 


Arg 


Arg 


Ala 


Phe 


Lys 


Arg 


Lys 


His 


Asp 


Thr 


He 


Leu 


Val 


334 










165 










170 










175 





1008 



1056 



1104 



1152 



1200 



1236 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:3; N Pos . 594 
Seq#:3; Xaa Pos. 198 
Seq#:4; Xaa Pos. 198 

Use of <220> Feature(NEW RULES); 

Sequence(s) are missing the <220> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213> ORGANISM is "Artificial Sequence" 
or"Unknown". Please explain source of genetic material in <220> to <223> 
section (See "Federal Register," 6/01/98, Vol. 63, No. 104 , pp. 29631-32 ) 
(Sec. 1.823 of new Rules) 
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